Pattern Discovery for Multiple Data Sources Based on Item Rank

نویسندگان

  • Arti Deshpande
  • Anjali Mahajan
چکیده

Retail company’s data may be geographically spread in different locations due to huge amount of data and rapid growth in transactions. But for decision making, knowledge workers need integrated data of all sites. Therefore the main challenge is to get generalized patterns or knowledge from the transactional data which is spread at various locations. Transporting data from those locations to server site increases the cost of transportation of data and at the same time finding patterns from huge data on the server increases the time and space complexity. Thus multi-database mining plays a vital role to extract knowledge from different data sources. Thus the technique proposed finds the patterns on various sites and instead of transporting the data, only the patterns from various locations get transported to the server to find final deliverable pattern. The technique uses the ranking algorithm to rank the items based on their profit, date of expiry and stock available at each location. Then association rule mining (ARM) is used to extract patterns based on ranking of items. Finally all the patterns discovered from various locations are merged using pattern merger algorithm. Proposed algorithm is implemented and experimental results are taken for both classical association rule mining on integrated data and for datasets at various sources. Finally all patterns are combined to discover actionable patterns using pattern merger algorithm given in section V.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Group Pattern Discovery Systems for Multiple Data Sources

INTRODUCTION Multiple data source mining is the process of identifying potentially useful patterns from different data sources, or datasets (Zhang et al., 2003). Group pattern discovery systems for mining different data sources are based on local pattern-analysis strategy, mainly including logical systems for information enhancing, a pattern discovery system, and a post-pattern-analysis system.

متن کامل

Using Multiple-Variable Matching to Identify EFL Ecological Sources of Differential Item Functioning

Context is a vague notion with numerous building blocks making language test scores inferences quite convoluted. This study has made use of a model of item responding that has striven to theorize the contextual infrastructure of differential item functioning (DIF) research and help specify the sources of DIF. Two steps were taken in this research: first, to identify DIF by gender grouping via l...

متن کامل

Methods for the Efficient Discovery of Large Item-Indexable Sequential Patterns

An increasingly relevant set of tasks, such as the discovery of biclusters with order-preserving properties, can be mapped as a sequential pattern mining problem on data with item-indexable properties. An item-indexable database, typically observed in biomedical domains, does not allow item repetitions per sequence and is commonly dense. Although multiple methods have been proposed for the effi...

متن کامل

Domain-Aware Multi-Truth Discovery from Conflicting Sources

In the Big Data era, truth discovery has served as a promising technique to solve conflicts in the facts provided by numerous data sources. The most significant challenge for this task is to estimate source reliability and select the answers supported by high quality sources. However, existing works assume that one data source has the same reliability on any kinds of entity, ignoring the possib...

متن کامل

Determination of the Parameters of Six Multiple Choice Tests of Mashhad University of Medical Sciences (1389-90) based on Item-Response Theory (IRT)

Background: According to the industrialization of countries and development of societies, tests and methods are required to employ people in industries and organizations and make the best selection in getting workforce. Interviews, Written tests  , and multiple choice tests are common methods used in employing people. Among these methods  , multiple choice tests is the easiest one because of th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017